Information Structure and Pauses in a Corpus of Spoken Danish

نویسنده

  • Patrizia Paggio
چکیده

This paper describes a study in which a corpus of spoken Danish annotated with focus and topic tags was used to investigate the relation between information structure and pauses. The results show that intra-clausal pauses in the focus domain, tend to precede those words that express the property or semantic type whereby the object in focus is distinguished from other ones in the domain.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotating Information Structure in a Corpus of Spoken Danish

This paper presents the work done to annotate a corpus of spoken Danish with information structure tags, and describes a preliminary study in which the corpus has been used to investigate the relation between focus and intra-clausal pauses. The study indicates that the pauses that do fall within the focus domain, tend to precede property-expressing words by which the object in focus is distingu...

متن کامل

Stress, pauses, pronominal types and pronominal functions in Danish spoken data

In this paper we present a study of the relation between types of third personal singular neuter pronoun and their functions in Danish spoken data where stress information is marked so that personal and demonstrative occurrences of the pronouns can be distinguished. This study confirms that there are language specific differences in the way various types of pronoun are used to refer to abstract...

متن کامل

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

The Value Of Minimal Prosodic Information In Spoken Language Corpora

This paper reports on an investigation into representing tone unit boundaries (pauses) as well as words in a corpus of spoken English. An analysis of data from MARSEC (Machine Readable Spoken English Corpus) shows that, for professional speakers, the inclusion of this nfinimal prosodic information will lower the perplexity of a language model. The analysis is based on information theoretic tech...

متن کامل

The Segmentation of Speech

This paper reports a phenomenon supporting the hypothesis that the emergence of structure in the evolution of language was a staged process. To develop a grammatical structure it seems necessary to first have discrete constituents which can be the building blocks of a hierarchical system. By analysing observed speech we show that the development of a linear sequence of grammatical constituents ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006